Korpus: pol_newscrawl_2011_1M

Weitere Korpora

3.7.1 String similarity graph for words

General information for Levenshtein distance for words. The data are considered as graph with words as nodes and edges weighted with Levenshtein similarity. Only the top 1.000.000 words are considered.

Number of nodes in the top-1M Levenshtein graph
Number of nodes
458974
Number of edges in the top-1M Levenshtein graph
Number of edges
3206602
Minimum word length
Word length
6
Edge weights in the top-1M Levenshtein graph
Levenshtein distance Number of edgeses
0 74953
1 693841
2 2437808
4747 msec needed at 2018-03-20 03:03